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In this paper, we consider a d-dimensional continuous Ito process which is observed at n regularly 
spaced times on a given time interval [0,T]. This process is driven by a multidimensional Wiener 
process and our aim is to provide asymptotic statistical procedures which give the minimal 
dimension of the driving Wiener process, which is between (a pure drift) and d. We exhibit 
several different procedures, all similar to asymptotic testing hypotheses. 
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1. Introduction 

In numerous applications, one chooses to model a complex dynamical phenomenon by 
stochastic differential equations or, more generally, by semimartingales, either because 
random forces excite a mechanical system, because time-dependent uncertainties disturb 
a deterministic trend or because one aims to reduce the dimension of a large-scale sys- 
tem by considering that some components contribute stochastically to the evolution of 
the system. Respective examples of applications are mechanical oscillators submitted to 
random loading, prices of financial assets and molecular dynamics. 

Of course, the calibration of the model is a crucial issue. A huge literature deals with 
the statistics of stochastic processes, particularly of diffusion processes. Parametric and 
nonparametric estimators of the coefficients of stochastic differential equations have been 
intensively studied; sec, for example, the books [6] and [7] of Prakasa Rao, in which a 
large number of papers are quoted and analyzed. However, somewhat astonishingly, it 
seems to us that most of the papers consider that the dimension of the noise is known 
by the observer. This hypothesis is often questionable: there is no reason to a priori fix 
this dimension when one observes a basket of assets or a complex mechanical structure 
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in a random environment. Actually, the last two authors of this paper were motivated to 
study this question by modelling and simulation issues related to the pricing of contracts 
based on baskets of energy prices (see O. Bardou's thesis [3]). There was no determining 
financial reason to fix the Brownian motion dimension to a particular value. In addition, 
the interest in finding a dimension as small as possible was twofold: first, one then avoids 
the calibration of useless diffusion matrix components; second, practitioners need the 
simulation of the model, and thus the computation of contract prices and of corresponding 
risk measures by means of Monte Carlo simulations, to be as rapid as possible. 

We thus try, in this paper, to tackle the question of estimating the Brownian dimension 
of an Ito process from the observation of one trajectory during a finite time interval. More 
precisely, we aim to build estimators which provide an 'explicative Brownian dimension' 
r^: a model driven by an rs Brownian motion satisfyingly fits the information conveyed 
by the observed path, whereas increasing the Brownian dimension does not allow to fit 
the data any better. Stated this way, the problem is obviously ill-posed, hence our first 
step consists of defining a reasonable framework to develop our study. 

Suppose that we observe a continuous c?-dimcnsional Brownian scmimartingale X = 
(X*)i<i<d on some space (fi, J^, (J^t), P). The observation time interval is [0,T] with T 
finite. The process X is a continuous Ito process, meaning that it satisfies the following 
assumption. 

Hypothesis (H). We have 



where is a standard (/-dimensional Brownian motion (BM), a is a predictable R**- 
valued locally bounded process and cr is a d x g-matrix- valued adapted and cadlag process 
(in (1), one can replace Us by the left limit CTs- so as to have a predictable integrand, if 
one wishes). 

We say that we are in the 'pure diffusion case' when as = cr{Xs)- We set Cg = (JsCr* 
(so Cg = c{Xs), where c = aa* in the pure diffusion case). The process c takes its values 
in the set Add of all d x d symmetric non-negative matrices. We denote by rank(I]) the 
rank of any E e Aid- 
As is well known, the same process X can be written as (1) with many different Wiener 
processes; namely, if (11^) is a progressively measurable process taking its values in the 
set oi q X q orthogonal matrices, then Wl = J^^ Us dWg is another g-dimensional Wiener 
process and X is of the form (1) with W' and a'^ = asTlJ^. The 'Brownian dimension' 
tb oi our model is then defined as being the smallest integer r such that, after such 
a transformation, the last q — r columns of ct^oj) vanish (outside a P(da;) ds-nuU set, 
and for s < T, of course). In this case, we can forget about the last g — ?' components 
of W and, in fact, write (1) with an 7'-dimensional Wiener process. Obviously, tb < d 
always, so one could start with a model (1) with q < d always, but it is convenient for 
the discussion in this paper to take q arbitrary. 




(1) 
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Our aim is to make some kind of inference on this Brownian dimension tb, which is 
also the maximal rank of Cg (up to a P(da;)ds-null set), on the basis of the observation 
of the variables Xjy/„ for i = 0,l,...,n, where [0, T] is the time interval on which the 
process is available. Let us make some preliminary comments, in which we refer to the 
"ideal" and "actual" observation schemes when one observes X completely over [0,T], 
or at times iT /n only, respectively. 

(1) Suppose we are in the pure diffusion case, that is, Cs ~ c(Xs), with c a continuous 
function, and that the range of the process is the whole of R** (i.e., every open 
subset of R'' is visited by X on the time interval [0, T] with a positive probability). 
Set r{x) := rank(c(a;)) and let A{uj) be the subset of R*^ which is visited by the 
path {X{uj)t:t G [0,r]). The Brownian dimension is = sup^g^^d r(a:), but, in 
the ideal scheme, we observe R{uj) = sup^g^j-^^^ r(x) and so we can only assert 
that r > R{uj). The situation is similar to what happens in the nonparametric 
estimation of the function c: in the ideal scheme, this function c is known on A{uj) 
and hopelessly unknown on 'EV^\A{uj). 

(2) More generally, the only relevant quantity we might hope to 'estimate' is the 
(random) maximal rank 

R{lli) = sup rank(cs(w)) (2) 

se[o,T) 

(we should take the essential supremum rather than the supremum, but the two 
agree since Cs is right-continuous in s). The variable R is integer- valued, so its 
'estimation' is more akin to testing that R = r for any particular r £ {0, . . . ,d}, 
although it will not be a test in the ordinary sense because R is random. Note that 
in many models, we will have that rank(cs(ti;)) is independent of s and lo; R is 
then non-random, but this property does not really makes the analysis any easier. 

(3) In the actual scheme, we will construct an integer-valued statistics i?„ which serves 
as an 'estimator' for R. We have to somehow maximize (and evaluate) the proba- 
bility that Rn = R, or perhaps this probability conditional on the value taken by 
R, or conditional on the whole path of X over [0, T] . That is, we perform a kind 
of 'conditional test'. 

(4) We might also take a different look at the problem. Considering the model (1), we 
can introduce a kind of 'distance' A,, between the true process X and the class 
of all processes X' of the same form, but with a diffusion coefficient c'^ satisfying 
identically rank(Cg) < r. We then construct estimators A" for A^, for all values of 
r, and decide on the basis of these A" which Brownian dimension is reasonable 
to consider for the model. The mathematical problem is then similar to the semi- 
parametric estimation of a parameter in the diffusion coefficient for a discretely 
observed diffusion with unknown drift: here, the 'parameter' is the collection of all 
Ar and the unknown (nuisance) parameters are the processes and Cs (or ag). 

The paper is organized as follows. In Section 2, we explain in a more precise way the 
'distances' mentioned above. Section 3 is a collection of simple linear algebra results and 
Section 4 contains the basic limiting results needed. In Sections 5 and 6, we then put the 
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previous results in use to develop some statistical applications and, finally, we provide 
some numerical experiments in Section 7. 

2. An instructive but non-effective approach 

In this section, we measure the discrepancy between the model (1) and models of the 
same type but with a different Brownian dimension. Wc denote by Sr the set of all cadlag 
adapted d x g-matrix-valued processes a' such that c'^ = ct^cTs* satisfies rank(cs) < r a.s. 
for all s. In particular. So contains only a' = 0. With any a' E Sr, we associate the process 

X't^Xo+ f a,ds+ f <j',dWs, (3) 
Jo Jo 

with the same a and the same W as in (1). 

A measure of the 'distance' between the two processes X and X' of (1) and (3), 
measured on the time interval [0,t], is the random variable A{X,X')t defined below. 
In the following formula, H ranges through all predictable d-dimensional processes with 
||^^t(w)|| < 1 for all {co,t), H* is the transpose, (M) is the quadratic variation process of 
the semimartigale M and • denotes stochastic integration: 

A(X,X')t= sup {H* •{X-X'))t. (4) 

H: \\H\\<1 

This measurement of the discrepancy between X and X' is particularly well suited to 
finance, where E(A(X, is a measure of the difference in the sense between the 

portfolio evaluations when one takes the model with X or the model with X' . Then set 

A(r;X)t =inf(A(X,X')t;X'is given by (3), with cr' e 5^) (5) 

for the 'distance' from X to the set of scmimartingales with Brownian dimension not 
more than r, again on the time interval [0,f]. 

Remark 1. Of course, A(X, X')t is not a genuine distance, for two reasons: it does not 
satisfy the triangle inequality (it is rather the square of a distance) and, more impor- 
tant, it is random. The genuine distance (which is one of Emery's distances ~ see [5]) is 
•\/E(A(X, X')t), provided we identify two processes which are a.s. equal, as usual. 

Also, note that the two approaches - here, where W is kept fixed, and in the previous 
section, where W may be changed into another Wiener process W - look different, but 
are actually the same. 

The next proposition shows how to compute 'explicitly' A(r; X)t. We denote by A(l)s > 
A(2)s > • • • > X{d)s > the eigenvalues of the matrix Cg and we set 



L{r)t^ f X{r)sds. 
Jo 



(6) 
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Proposition 2. For any r = 0, . . . , d — 1, we have A(r; X)t = L{r + 1)^ and the infimum 
in (5) is attained. 

Proof. It is no restriction to suppose that g > d (if not, we can always add independent 
components to W and, accordingly, components to a which are 0). Let J be the d y, q 
matrix with entry equal to 1 when 1 < i < d, and all other entries equal to 0. We 
can then find two cadlag adapted processes Hg and Qs, with values in the sets of d x d 

1 /2 

and q X (J orthogonal matrices, respectively, and such that as = lis As JQs^ where As is 
the diagonal matrix with entries X{i)s - Note that Cs = IIsAsII*. 

Let Ir (resp., I'^) be the d x d matrix with {i,i) entry equal to 1 when I <i <r 

1/2 

(resp., r-\-l<i<d) and all other entries equal to 0. We then set = Hs^s IrJQs and 

1 /2 

associate X' by (3). Then (jg — cr's~ IIsAs I'rJQs and thus 

{H* . (X ~ X'))t = [\H*IisK\I^IlJQsQiri:A\'^KHs) ds. 
Jo 

The integrand above is simply H*IisAl^^ I^aI^^II*Hs and, if \\Hs\\ =■ 1, we also have 
||n*i7sl| = 1 and thus this integrand is not bigger than A(r + l)s. Therefore, A(X, X')t < 
L(r + l)t. Furthermore, a'so'* = XIsAs/rll* is of rank < r, so o € . 

Now, let a' be any process in Sr and put c'^ = o'so'* . The kernel K'^ of the linear map 
on R'' associated with the matrix c'^, is of dimension at least d — r. The subspace Kg 
of R'' generated by all eigenvectors of this linear map, which are associated with the 
eigenvalues A(l)s, . . . , A(r + l)s, is of dimension at least r + 1 (it is strictly bigger than 
r + 1 if A(r + l)s = A(7' + 2)s). Kg n K'^ is then not reduced to {0} and we can thus find 
a process H = {Hs)s>o with ||-ffs|| — 1 and Hg G Kg n K'^ identically, and obviously this 
process can be chosen to be progressively measurable. Then c'gHg = (because Hg e K'^) 
and H*CgHs > X{r + l)s||-ffs|| = A(r + l)s (because Hg £ Kg). The first property above 
yields 

H*»X[= [ H*agds, 
Jo 

hence 

{H* . {X - X'))t - {H* . X)t - f{Htc*Hg) ds, 

Jo 

which, by the second property above, is not less than L{r + \)t. Hence, we are done. □ 

In particular, L{l)t = A(0; X)t measures the 'distance' between the process X and the 
pure drift process + /q ag ds. The following is obvious, with the convention L{d+ l)t = 
0: 



Rt<r ^ L{r + l)t = A{r;X)t=0, 



(7) 
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where, similarly to (2), we have set, for all t > 0, 

R{uj)t = sup rank(cs(tj)). (8) 

Hence, the random value L(r + l)j measures the distance between X and the set of all 
processes with Brownian dimension r, over the time interval [0,t], and for our particular 
path of X. Note also that it is an 'absolute' measure of the distance, which is multiplied 
by if wc multiply the process X by u. 

Unfortunately, the variables L{r)t do not seem to be easy to 'estimate' from discrete 
observations since they involve eigenvalues. Hence, we will construct estimators below 
which are easier to handle. 



3. Linear algebra preliminaries 

Consider for a moment the toy model X = aW, where cr is a non-random d x q ma- 
trix. That is, we have (1) with Xq ~ 0, as = and (Tg^to) ~ a, or, cquivalcntly, X is a 
Wiener process with covariance St at time t, where S = aa* . The observation scheme 
amounts to observing n i.i.d. random vectors Gi, all of them A/"(0, E)-distributed (viz. 
Gi = Vn/TA'^X, with the notation A'^X = X,T/n - X{i-i)T/n)- To infer rank(E) from 
the observations of the n first variables Gfc, wc can use the empirical covariance 

1 " 

^"--E^-feG^ (9) 
fc=i 

Indeed, the variables Gk have a density over their support, which is a linear subspace 
with dimension rank(S). Hence, rank(S„) is almost surely equal to n when n < rank(S), 
and to rank(I]) otherwise, and the problem is solved in a trivial way. 

If we are in the same setting, except where depends on s (and is still deterministic) 
and has a constant rank r, then, typically, the eigenspaces 'rotate' when s varies, and 
the rank of I]„ above is a.s. equal to d as soon as n > d. Therefore, E„ gives no insight 
on the rank. So, the problem for non-homogeneous Wiener processes and, a fortiori, for 
general diffusions like (1), is actually more complex. 

Despite the uselessness of the toy model consisting of an homogeneous Wiener process, 
let us give a couple of formulas relating to it, for further reference. We denote by Ar the 
family of all subsets of {1, . . . , d} with r elements {r = 1, . . . ,d).lf K G Ar and S = (E'^) € 
Md, we denote by detA'(E) the determinant of the r x r submatrix (E*'' -.k^lE K) and 
let 

det(r;E)= ^ det(E). (10) 



Observe that det(d;E) = det(E), while det(l;E) is the trace of E. 
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Lemma 3. IfTjCz Add has eigenvalues A(l) > • • • > X{d) > 0, we have, for r = 1, . . . ,d, 
1 



d{d-l)---{d-r + l) 



det(r; E) < A(1)A(2) • • • A(r) < dct(r; E). (11) 



Notice that both inequaUties in (11) may be equaUtics. It foUows from (11) that, with 
the convention 0/0 = 0, we have 

l<r<d => (^^'■"tS ^ hIS'^'S'^o (12) 
- - [r>rank(E) =^ det(r;i;) = 0, ^ ^ 

rl det(r;E) , dl det(r;S) , , 

^ Z det(.-l;E) ^^^-)^ (.-l)!det(.-l;E) - ^''^ 

Proof of Lemma 3. We expand the characteristic polynomial of S as 

d 

det(E - XI) = (-A)'^ + ^(-A)'^-'' det(r; S). 

r=l 

In view of the well-known expressions for the 'symmetrical functions' of the roots of a 
polynomial, we get 



J2 A(^l)A(^2)•••A(v)=det(r;S) 



l<ii<-<ir<d 

and thus both sides of (11) are obvious. □ 

Next, we consider a sequence {Gi)i>i of i.i.d. A/'(0, S)-distributed random vectors. For 
all j = 1, . . . , d, we define two random elements of A^d by 

and we consider the mean and covariance of the random vector (det (r ; )/r ! : 1 < r < d) : 

(15) 



7(r;E) = iE(det(r;C.)), 
r(r, r'; S) = -i-E(det(r; C) det(r'; CO) " l{r; ^h{r'; S). 



rlr'l 

Since {CjXj : 1 < j < c?) are i.i.d., we also have 



r(r, r'; S) = -i-E(det(r; C.) det(r'; Cr') - det(r; C) det(r'; CO)- (16) 
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Lemma 4. // r S {1, . . . , d}, we have 

7(r;S])=det(r;S). (17) 

Moreover, 

r<rank(S) =^ 7(r; S) > 0, r(r, r; S) > 0, 1 
r>rank(S) =^ 7(r; S) = 0, r(r, r; E) = 0. J 

Proof. To prove (17), it is enough to show that for any K E Ar, we have E(deti<-(Cr)) = 
r!det;f (E), and for this, it is no restriction to assume that K ~ {1, . . . ,r}. We denote 
by Vr the set of aU permutations of the set {1, . . . ,r} and by £(t) the signature of the 
permutation r. Then 



det(c.)= E(-ir^^^nc^^'= E E(-i)^^^^nGK' 



TGVr 1 = 1 l<ki,...,kr<rT£'Pr 1=1 

and each summand of the first sum of the extreme right-hand side is the determinant of 
a matrix with rank less than r, unless all ki are distinct. So, it is enough to sum over 
all r-tuples (fei, . . . , fc^) with distinct entries between 1 and r, that is, for r-tuplcs with 
hi = t' {i) for some t' CzVr. In other words, we have 



d|t(c.)= E E(-i)^^*^'nG^'(oG;%- (19) 

T'eVrTeVr 1=1 
Since the variables G„ are independent and E(G'^G'Jj) = E*^', we deduce that 

r 

E(det(C.)) = H E (-l)^(^'n^'''" =^!dct(E) 

reVr 1=1 

and we have (17). 

If r > rank(E), we have det(r;E) = (sec (12)): the non-negative variable det(r;Cr) 
has zero expectation, so it is a.s., null and we have the second part of (18). Finally, let 
r < rank(E). Again, by (12), we have E(det(r; C,r)) > 0. Also, observe that det(r; (r) is a 
continuous function of the random vectors Gn for n — I, . . . ,r which vanishes if all these 
Gn are 0. Thus, det(r;(^r) can take arbitrarily small values with positive probability and 
it has a positive expectation, so it is not degenerate and we get the first part of (18). □ 

4. Limit theorems for estimators of the Brownian 
dimension 



It turns out that determinants or 'integrated determinants' arc much easier to estimate 
than eigenvalues or integrated eigenvalues. So, in view of (11) and (12), one might replace 



Estimation of the Brownian dimension 477 
the variable L{r)t of (6) by 



L'ir) 



I det(l;cs)ds, 
* det(r;cs) 



i 



det(r — 1; 



if r = l, 
ds, if r > 2. 



However, L'{r)t for r > 2 is still not so easy to estimate. For example, for the toy model 
of Section 3, the variable (lei{r]C,r)/r\ is an unbiased estimator of det(r;S) (see (17)), 
but we have no explicit unbiased estimator for a quotient like det(r; E)/dct(r — 1;S). 

So, we propose to measure the distance between X and the set of models with multi- 
plicity r, over the time interval [0,t], by the following random variable: 

L{r)t= f det(r;c,)ds. (20) 
Jo 

Up to multiplicative constants, this more or less amounts to replacing the 'natural' dis- 
tance L{r)t by X{l)s ■ ■ ■ ^(r)^ ds. The variables L{r)t, L'{r)t and L{r)t convey essen- 
tially the same information as far as rank is concerned and, in particular, they vanish 
simultaneously, which is the most important property for our purposes. In other words, 
exactly as in (7), we have 

Rt<r ^ l(r + l)t = 0. (21) 

By virtue of (17), we can rewrite L{r) as follows (wc also introduce additional variables 
Z{r,r'), using the notation (15)): 

L{r)t = / -fir; c,) ds, Z{r, r')t - / T{r, r'; c,) ds. (22) 



Now, we need to approximate the variables in (22) by variables which depend on our 
discrete observations only. To this end, we introduce the random matrices 

r 

C(r)^=^(Ar+,_lX)(Ar+,_lX)^ where ArX = X,T/„-X(,_i)T/„. (23) 

We have C{r)2 G Aid and rank(C(r)") < r. We then set (with [x] being the integer part 
of x) 

_ r-l [nt/T]-r+l 

= E dct(r;C(r)r) (24) 

and 

r+r'-l [■nt/T]-d-r' + l 

Z(^y)t = E (det(r;C(r)r)det(r';C(r')r) 
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(25) 

-det(r;C(r)r)dct(r';C(r')S+,))- 
The first key theorem concerns the 'consistency' of these variables. 

Theorem 5. Under (H), the variables L{ry^ and Z(r,r'y^ converge in probability to 
L{r)t and Z{r,r')t, respectively, uniformly in t ^ [^^T]. 

This is not enough for our purposes and we need rates of convergence. For this, (H) is 
not sufficient and some additional regularity on the coefficients a and cr is necessary. A 
first set of sufficient conditions is simple enough. 

Hypothesis (HI ). We have (H) with a cadlag process a and a process a which is Holder 
continuous (in time) with index p > 1/2, in the sense that 

0<s<t<T {t~S)P 

The above assumption on a is quite mild and the assumption on cr is reasonable when 
a is deterministic. However, in the pure diffusion case, we have cts = cr(Xs) for, say, a 
Lipschitz or locally Lipschitz function a and, of course, (26) fails for any p > 1/2. This 
assumption also fails when as is a 'stochastic volatility' driven by an Ito equation and 
even more if this equation has jumps! 

Therefore, for practical purposes which are especially relevant in finance, we need to 
replace (HI) by a different assumption. This assumption looks (is ?) complicated to 
state, but it essentially says that a is as in (HI) and that the process cr follows a jump- 
diffusion Ito equation, or, in other words, that it is driven by a Wiener process and a 
Poisson random measure). In particular, it is satisfied in the pure diffusion case when 
as = a{Xs)j with a function a. 

Hypothesis (H2). We have (H), the process a is cadlag and the process c is a (possibly 
discontinuous) Ito scmimartingale on [0,T], that is, for i < T, we have 

at^ao+ f «; d.s + / cr^„ dWs 

t t (27) 

+ / / (f o w{s— ,x){ij — u){ds,dx) + / {w — tp o w){s—,x)fi{ds,dx). 
Jo J E J a J E 

Here, a' is R'' ® R'' ® R'^-valued adapted cadlag and a' is R'' ® R'^-valucd predictable 
and locally bounded; p, is a, Poisson random measure on (0,oo) x independent of 
W and V, with intensity measure v{dt,dx) = dtF{dx) with F a cr-finite measure on 
some Polish space {E,£); ip is a continuous function on R'''' with compact support, 
which coincides with the identity on a neighborhood of 0; finally, w{uj,s,x) is a map 
f2 X [0, oo) X E IV^ (E) R"? which is J-g ^ ^-measurable in (oj, x) for all s, cadlag in s and 
such that /g(l A sup^^^ ||u;(a;, s, x)\\'^)F{dx) < oo. 
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These conditions are indeed quite easy to check in practice. They accommodate the 
case of a stochastic volatihty driven by a Wiener process having some (or all) components 
independent of X: since W has an 'arbitrary' dimension q in this paper, possibly q> d, 
there might be components used for X in (1) and other components used in (27). 

Theorem 6. Assume either (HI) or (H2). The d- dimensional processes (V^(?')")i<r<d 
with components 

V(r)^^^i{L{r)^-L{r)t) (28) 

converge stably in law to a limiting process {V(r)t)i<r<d which is defined on an extension 
of the original space and which, conditionally on T , is a non-homogeneous Wiener process 
with quadratic variation process < i— > (TZ(r,r')t)i<r,r'<d- 

Proof of Theorems 5 and 6. The proof involves several steps. 

(1) It is based on the following two results of [4]. Take N functions gj on R'' which 
are and with polynomial growth and even. Set 

[nt/T]-N+l N 

i=i k=i 

Then, under (H), we have 

Yigi,...,gN)t -^Yi9i,---,9N)t-^ / yigi, . . . ,gN;cs) ds, 

Jo 

where the convergence in uniform in t G [0, T] and where y{gi, . . . , gN] S) is, for any dx d 
covariance matrix E, the expectation of the variable 

N 

7(Si,---,57v) = J|fffc(Gfc) 

k=l 

and the G,i's are i.i.d. random vectors with law A/'(0,E), as in Section 3. 

If, further, (H2) holds, then for any array {{g^, . . . , gj^ ,) : 1 < j < J) with gf as above, the 

J-dimensional processes {^/n/T{Y{g{, . . . ,g'J^.)" — Y{gl, . . . ,glf,)t)i<j<j converge stably 
in law to a limiting process which, conditionally on T, is a non-homogeneous Wiener 
process with quadratic variation process /p r(cs)ds and where r(I]) is the covariance 
matrix of the random vector {^{gl, ■ . . ,5Ar.))i<i<Ji as defined above. Under (HI) instead 
of (H2), the same result holds: it is not explicitly stated in [4], but the proof is similar, 
and technically much simpler. 

(2) These results extend by 'linearity' in an obvious way. More precisely, for 1< j < J, 
set 

[nt/T]-N, + l 

y{3)t=- E /i,(v^Afx,y;vTAr+iX,...,v/;v7'Ar+^^._iX), (29) 

I— 1 
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where each hj is a hnear combination of tensor products gi® ■ ■ • ^ Qn- , where the g^'s are 
functions on R**, even and with polynomial growth. Also, denote by M(S) and C(T,) 
the mean vector and the covariance matrix, respectively, of the J-dimensional random 
vector {hj(Gi, . . . ,GNj))i<j<j , with Gi as above. Then: 

1. under (H), we have 

Y{j)t ^ Y{j)t := / M\cs) ds, uniformly in t G [0,T]; (30) 



2. under (HI) or (H2), the J-dimensional processes with components y^n/T{Y{j)'^ — 
Y{j)t) converge stably in law to a limiting process which, conditionally on JF, is a 
non-homogeneous Wiener process with quadratic variation process /p C{cs) ds. 

(3) The theorem is now almost trivial. The determinants entering (24) (resp., (25)) are 
sums of even monomials of the components of Af^j_-^X for 1 < j < 2d, each one with 
degree 2r (resp., 2(r + r')). More specifically, L{r)'^ is of type (29), with Nr^r and the 
function 



^det |^r,^x,4^. 



whereas Z{r,r')^ is of type (29), with Nry =d~\-r' and the function 



Jti (^dct l^r, ^J^)^ det |^r', ^ Xjx)^ 

/ r \ / d+r' \\ 

— det r, > Xoxt dot r', > XoX^ . 

Theorem 5 then follows readily from step 2 and the relations (15). □ 

In the sequel, we will also need some estimates on the moments of V{r)", uniform in 
n. It follows from the proofs in [4] that, under (H2) and for each t E (0,r], there is a 
sequence Ap^t of -measurable sets such that 



Ap n as p ^ oo , 

P>1, n>l, te(0,T] =^ i:{\VmHA^,)<Cp 



(31) 



for a suitable sequence of constants Cp (depending on T). The same result also holds 
under (HI). 

Now, if the coefficient a is bounded and if we have (HI) with (26) holding uniformly 
in uj, or (H2) with a'g{Lu), a'g — cu bounded and ||i(;(w,s,x)|| < h{x) for some function 
having J^{1 /\h{x)^)F{dx) < oo, one can (easily) prove that (31) holds for Ai^t = ^, and 
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so there is a constant C, depending on T, such that 

n>l,te{0,T] =^ E(|y(r)J'n<C. (32) 

5. Tests based on thresholds 

5.1. A test based on an absolute threshold 

We return to the initial problem, in the light of the second comment of Section 1. Namely, 
we want to decide which integer value (between and d) the variable R of (2) takes for 
the particular path lo which is known only through the observations Xix/n- In principle, 
we have our observations Xix/n for i = 0, . . . ,n, but it may be interesting to determine 
how our estimators behave as time changes. This is why we also give estimators for the 
variable Rt of (8), based on the observation of Xix/n for i = 0, ■ ■ ■ ,[nt/T]. 

Let us reiterate that in the ideal scheme (the whole path of X is known over [0, T]), we 
also know R = R{w), whereas we have the equivalence (21). In view of this, and taking 
into account the convergence result in Theorem 5, it seems natural to operate as follows. 
We choose a sequence of positive numbers p„ such that 

Pn^O, p„^/n^oo. (33) 

We then take the following 'estimator' for Rt'. 

Rn.t = inf (r e {0, . . . , d - 1} : L(r + 1)^* < p^t), (34) 

with inf (0) = d. That this estimator is a priori reasonable is due to the fact that if we set 
Rn^t = inf(r e {0, . . . , d — 1} : L{r + l)t < Pnt), then by (21) and the property Pn — > 0, we 
have P{Rnj = Rt) ^1 as n ^ 00. We take a threshold of the form p„t because L{r + 1)" 
is roughly proportional to t. 

Remark 7. Another equally reasonable estimator, which is a kind of 'dual' of i?„^t, is 
the following one: 

K^t = sup(r e {1, . . . , d} :r(r)r > Pnt), (35) 
with sup(0) = 0. The analysis of Rn t below carries over for i?^ ^ in a very similar way. 

Remark 8. The choice of the threshold p„ is arbitrary, upon the fact that (33) holds; 
asymptotically all choices are equivalent. In practice, though, it is of primary importance 
because n, albeit large, is given and is of course not infinite! Even worse, an absolute 
threshold like in (34) is sensitive to the unit in which the values of the XI are expressed. 
For example, if we multiply all components by the same (known) constant, the estimator 
of the Brownian dimension provides a different value. So, using an absolute threshold 
is probably not advisable in general. Nevertheless, we pursue here the analysis of tests 
based on an absolute threshold since they may serve as a case study and arc somewhat 
simpler to study than the tests based on relative thresholds which are introduced later. 
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The integer-valued estimator Rn,t should be analyzed using the testing methodology 
rather than as a usual estimator: we test the hypothesis Rt = r with the critical region 
{Rn,t 7^ r}. The 'power function' is, in principle, the probability of rejection, a function 
of the underlying probability measure. Here, we have a single P and Rt is (possibly) 
random. We thus develop two different substitutes for the power function. 

5.2. A first substitute for the power function 

A seemingly acceptable version of the power function is 

Pl,{r') = P(R„^t^r\Rt=r'), r' = 0, 1, . . . , d, (36) 

provided P(i?t = r') > 0. We explicitly mention the number n of observations and the 
number r, but it also depends on the sequence p„. The index r indicates the 'test' with 
null hypothesis Rt ~ r which we arc performing, while the index r' indicates the 'true' 
value or Rt- So, I3n t{r) should be small and Pn^ti"""') should be close to 1 when r' ^ r. 
We have a first - quite simple - result. 

Theorem 9. Under (33) and either (HI) or (H2), we have, for all r,r' in {!,..., d}, 
and provided P{Rt ~r') > 0, 

Another equivalent (simpler) way of stating this result is to write 

P(i?„,t ^ Rt) ^ 0. (38) 

This is more intuitive, but somehow further away from the way results on tests are usually 
stated. 

Proof of Theorem 9. For each s = 1, . . . , (r' + 1) A d, we set (5^_t(r') = P(I(s)J' < 
PrnRt = r'). Observe that, if p'.^ = Pn\pn, and with the notation (28), 

51,^) < P(L{s)t < 2p^t,Rt = r') + P{\V{s)^\ > p'J), 
P{Rt = r') - 6Ur') < P{L{s)t > p„t/2, Rt = r') + P{\Vis)^\ > p'J/2). 

Theorem 6 yields that the sequence V{s)" converges in law when n goes to infinity, 
whereas on the set {Rt = r'}, we have L{s)t > if s < r', and L{s)t = if s > Therefore, 
it follows from (33) that 
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Now, {Rn,t ^ r'} = (Ui<s<r'{^(s)r < Pnt}) U {Lij' + 1)^* > p„i}, with the convention 
that {L{d + l)r > Pnt] = 0. Then 



Y^SUr') + P(i?, = /) - <r (r'), if r' < d - 1, 

r' 



It then readily fohows from (40) that ^{r') ^ as soon as P(i?t = r') > 0. Under this 
assumption, and if r 7^ r', we clearly have j(r') = 1 — P(i?„^i = r | i?t = r') > 1 — ^(r), 
hence^;,4(r')-^l. □ 

The previous result seems to settle the matter. However, it is not as nice as it may 
look, because it gives no 'rate' for the convergence in (37) or (38) and is thus impossible 
to use in practice. The impossibility of getting a rate is apparent in (39): the second 
terms on the right may be more or less controlled through estimates like (31), but the 
first terms on the right cannot be controlled at all; indeed, if Rt = r, the variable L(r)t 
is positive, but may be arbitrarily close to 0. 



5.3. A second substitute for the power function 

As emphasized in Comment 4 of Section 1, one may reasonably decide that i?t = r if r is 
the 'true' Brownian dimension in a 'significant' way, which means, in particular, that the 
'distance' between the model X and the set of models with Brownian dimension r' < r 
is not 'infinitesimal'. This may be interpreted as the property that L{r)t exceeds some 
positive level for all r < Rt. 

In other words, we set Br'^e^t — {Rt — t' ,L{r)t > et for ?■ = 1, . . . ,r'} and define the 
'power function' as being 

p:4r\e)^P{Rn,t^r\Br',e,t), (41) 

provided P{Br',e,t) > 0. 

Evaluating /3^((r',e) is still difficult, because it involves the unknown quantity 
P(i3r',e,t)- So, we provide a result which does not directly give the power function itself, 
but which is probably more relevant for applications. 

Theorem 10. Under (HI) or (H2), there are Tt -measurable sets [Ap^t] increasing to fl 
as p ^ 00 and constants Cp such that, for all r in {1, ... ,d}, and provided pn < e/2, 

F{{Rn,t^r}nBr,e,tnAp^t)<-% (42) 
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for all t € (0,T]. //, further, (32) holds, we can find a constant such that 

P({^„,t ^r}r^ Br,e,t) < (43) 

or, in other words, the 'level' satisfies (3'^ ti'''^^) — C'/('^P^P(^r,e,t))- 

Note that we can choose p„ above at will, provided it satisfies (33), and none of Ap, 
Br',e,t, Cp depend on this choice (the estimator R„^t does, though). So, we can obtain a 
rate for any 6 € (0, 1), as close to 1 as one wishes. 

Proof of Theorem 10. We consider the sets Ap^t for which (31) holds and denote by 
Cp the constants occurring in that formula. For s = 1, . . . , (r + 1) A d, we set (e > being 

fixed) S^ i{r,p) = P({Z(s)J' < p„t} n Br^ej n Ap^t)- Exactly as for (39), we have 

dUr,P) < P({i(s)t < 2p„t} n Br,e,t) + Pi{\Vis)^\ > p'j} n Ap^t), 
F{Br,e,t n Ap,t) - 5',tAr,P) < Pi{L{s)t > Pnt/2} n Br,e.t) + P({|T^(s)ri > Pntm H Ap,t). 

Taking into account (31), pn < s/2 and the facts that L{s)t — OiiRt~r<s and that 
L{r)t > et on B{r,e,t), we deduce from Cliebyshev's inequality that 

C 4C" 

s<r =^ 5l^{r,p)<^, p(ij nAp,t)-^;y(r,p)<^, (44) 

Pn Pn 

where the second equality makes sense when r < d only. Once more applying the identity 
{i?„,t ^ r} = {L{r + l)r > p„t} U (Ui<s<. W'S)? < Pnt}, we get 



P({i?„,t^r}nS,,,,tnAp,t)< < 



Y,5l^{r,p) + nBr,e,t n Ap.,) ~ 5l+\r,p), if r < d, 

r 



We then deduce (42) from (44) if we put Cp = {4 + d)C'p. Finally, under (32), we may 
choose Ai_t = ri above and thus (43) with C = (4 + d)C[. □ 

Of course, (42) is not useful in general, although it gives us a rate, because we do not 
know the sets Ap^f If (32) holds, the result appears much more satisfactory; however, we 
still do not know the constant C in (43) and have no mean to guess what it is from the 
observations. 



5.4. Tests based on a relative threshold 



In practice, the previous tests are not recommended; see Remark 8. We now exhibit other 
tests which are scale-invariant. 
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If we multiply X by a constant (5 > 0, then Cg is multiplied by 5"^ and both L{r)'^ 
and L(r)t are multiplied by (5^''. Then, for any given sequence p„ € (0, 1] satisfying (33), 
the following two 'estimators' of i?, which are candidates to be explicative Brownian 
dimensions^ are scale-invariant: 

Rn.t - inf (r e {0 . . . , d - 1} :I(r + 1)^' < p„i-iAjI(r)«)(-+i)/'-), 1 
K.,=inf(re{0,...,rf-l}:L(r + l)r<p„t-'-(L(l)rr+i), j 

with the convention that i(0)" = 1 and, again, inf(0) = d. The presence of t^/"" or f 
above accounts for the fact that L{r)" is roughly proportional to t, as in (34). 

Note that R'^^ * > li even when Rt = 0. So, if i?t = 0, this estimator is bad, but. in 
this case, our problem is essentially meaningless anyway! When -Rt > 1, the significance 
of these two estimators is essentially as follows: i?„_t is the smallest integer r for which 
there is a 'large' drop between the explicative powers of the models with Brownian 
dimensions r and r + 1, whereas i?^ ^ is the smallest integer r at which the ratio between 
the contributions of the (r + l)th and the first Brownian dimension is smaller than /5„. 
Clearly, there exist other estimators of the same kind, with slightly different meanings, 
the above two being the extremes. All such estimators are amenable to essentially the 
same mathematical analysis. 

In practice, the choice of p„ is relative to the physical phenomenon under consider- 
ation and to the use which is made of the model (prediction, simulation, computation 
of extreme values, etc.). Roughly speaking, the choice should reficct the physical effects 
which arc modelled as the driving noise and the intensity of components of the noise 
which are considered important to capture essential properties of the model. 

Here, again, the substitutes for the power functions are 

^;,(r') = P(i?„,t^r|i?t=r'), P'l,{r') ^P{Rl^^, ^ r \ Rt ^ r'). (46) 
We now aim to produce a result similar to Theorem 9. 

Theorem 11. Under (33) and either (HI) or (H2), we have, for aUr,r' in {!,..., d}, 
and provided P{Rt =r') > 0. 

Pn,t[r ) tfr^r' ^ ^ 

and the same for /3^''j(r'). 

Proof. We prove the result for /3^t(r') only, the other case being similar. For each 
s = 1, . . . ,r' A d, we set 

<5:,,(r') = P(I(s)r < Pnt-'/^Lis - l)-f^''-'\Rt = r'). 
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As in Theorem 9, we write p'j = Pn\pn^ and with the convention 0/0=1 and t e (0,T] 
fixed, we put 

Observe that, similarly to (39), we have 

5^,,(r') < P(L{s)t < 2p,,t-^/%L{s - l)tf-'^^%Rt = r') + P{\V{sU > p'^, Rt = /), 

V{Rt = r') - Sl^ir') < F{L{s)t > Pnt-^''{L{s ~ l)tt-^^'' 12, Rt = r') 
+ V{\V{sU>p'r,/2,Rt = r'). 

Moreover, Theorem 6 yields that on the set {L{s — l)t > 0} = {Rt > s — l}, the variables 
V{s)n converge stably in law to the variable 

~ ^ _ tV-I(s)t f V{s)t _ s V_{s-l)t \ 
^' (L(s - 1)0^/(^-1) U(s)t s-lL(s-l)J- 

Therefore, since (33) holds, we get (40). At this stage, we can reproduce the end of the 
proof of Theorem 9 to obtain (47). □ 

Obviously, the comments made after Theorem 9 for the estimators Rn,t apply to Rn,t 
or R'^ t and, in particular, the fact that (47) gives no rates. Moreover, there is nothing like 
Theorem 10 here because we have no moment estimates like (31) or (32) for the variables 
V{s)n of (48) (such estimates seem to be out of reach because of the denominators). 

Remark 12. One could also think of estimators similar to (35), for example, 

= sup(r e {1, . . . , d}:L{r)- > p^t-''^^-^\L{r - lyiY'^'-'^). (49) 

However, the previous analysis does not carry over to i?^^, again because we do not 

know whether the variables V{s)^ converge stably in law on the set {Rt < s — 1} (once 
more due to the presence of the denominators). 

Remark 13. Here, again, the problem of choosing the threshold p„ is crucial in practice, 
despite the fact that a relative threshold is - at least - insensitive to the scale. This is 
somehow illustrated by the numerical experiments conducted in Section 7. For real data, 
only the experience of the statistician, at this point, can help for the choice of pn- We 
hope, in a future work, to be able to derive some tests, again based on the consideration 
of the variables L{r)" , but which do not necessitate an arbitrary threshold. The next 
section is something of a first, and somewhat incomplete, attempt in this direction. 
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6. A test based on confidence intervals 

Finally, wc can take full advantage of the fourth comment in the Introduction. Namely, 
instead of trying to directly evaluate we can try to evaluate the variables L{s)t for 
all s = 1, . . . , d. 

In view of Theorem 6, this is quite simple. In restriction to the set {Rt > r}, the vari- 
ables y/n{L{r)^ — L(r)t) are asymptotically mixed normal, with a (conditional) variance 
TZ{r,r)t which, in turn, can be estimated by TZ{r,r)" because of Theorem 5. And on 
the set {Rt < r}, the variables ■\/n(L(r)" — L{r)t) = ^/nL{r)^ go to in law. 

This allows one to derive (asymptotic) confidence intervals for L{s)t. More precisely, 
we get 

\iinP{WnTZ{r,r)f(L{r)^-Lir)t)\ >j\Rt = r')^ P{\G\ > 7) (50) 

n 

for any 7 > 0, where G is an Af{0, 1) random variable, provided P(i?t = r') > and r' > r. 
This is quite satisfactory because Z{r, r)" is observable. On the other hand, as soon as 
P{Rt = r') > 0, r' < r and 7 > 0, we get 

limP(|V7rL(r)^| >7 I i?f =/) =0. (51) 

n 

This is less satisfactory because the confidence intervals based on this are not sharp. It 
is also difficult to obtain non-trivial limit theorems for the sequence L(r)", suitably nor- 
malized, when Rt <r. It seems linked to the speed with which the eigenspaces associated 
with the positive eigenvalues of Cg rotate in R"*. 

An example. Let us consider, for instance, the problem of 'testing' whether the scale- 
invariant variable St := t^~^L{r)t/{L{l)tY exceeds some prescribed level e > (with, 
say, 0/0 = 0). This variable is naturally estimated by S„^t = r^^I(r)^/(I(l)J*)''. 

Although, once again, this is not a testing problem in the usual sense, one can proceed 
as if St were a (deterministic) parameter and the null hypothesis is S't > e. Critical regions 
on which we reject this hypothesis are naturally of the form 

Cn,tiv) = {Sn,t<v}- (52) 

The 'level' of this test is 

alt =supPiC,,,t{v)\ St >x) (53) 

x'>e 

and its 'power function' for x € (0,e) is 

(3ltix)^P{CnAri)\St<x) (54) 

(it would perhaps be more suitable to use P(C„(77) | S* = a;) as the power function, but 
the better cannot be evaluated properly below). We also need the variable 

Z _j,^2ir-i Mrri? ( Z{r,r)^ ^rZi^r)^ r^Z(l,l)n 
L{1)1 \{L{r)\r L{r)fL{l)- (L(l)r)V' 
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which looks comphcated, but is actually computable at stage n from our observations. 
We then get the following result. 

Theorem 14. Assume (HI) or (H2). Let a£ (0,1) and take 7G R to be such that 
P(G > 7) = a, where G is an A/"(0, 1) variable. 

(i) IfP{St > £) > 0, the 'tests' with critical regions Cn,t{rin,t) have an asymptotical 
level less than or equal to a (i.e., limsup^a^"^' <a) if we take 



\/\Zn.t\ , . 

Vn.t = £ - 7 r- ■ (^6) 

(ii) //P(L(l)t > 0) = 1, the power function /J^"/ of the above test, with r]n,t given by 
(56), satisfies {x) 1 for any x € (0, e). 

The assumption P{L{l)t > 0) = 1 in (ii) is very mild: it rules out the case where the 
function s > Cs{uj) vanishes on [0,t] on a subset of Q with positive probability. If it fails, 
then the variable St is not well defined on this set anyway and the problem is essentially 
meaningless. 

Proof of Theorem 14. The result is based on the following consequences of Theorems 
5 and 6. We fix i g (0, T] and introduce the variables 

Vn:=V^{Snt-St)^t--'V^( 3''^' - 3"^^' ]. (57) 

^ ^ ■ ' ^ Kimir {L{i)tY) 

Theorem 6 yields that, in restriction to the set A — {L{l)t > 0}, the variables Vn converge 
stably in law to the variable 



iL{l)tr\L{r)t L{l)t 

Conditionally on the a-field and again in restriction to A, the variable V is centered 
normal with variance 

L(l)2'-V(L(r),)2 L(r),L(l), (^(l),)^ 

Finally, the above variable Z is the limit in probability of the sequence Z„ defined in (55), 
by virtue of Theorem 5. To summarize, we deduce that the variables T„ = Vn/ \/\Zn,t\ 
converge stably in law, again in restriction to A, to an A/'(0, 1) variable, say G, which is 
independent of T . In particular, for all y G R, we have 

Be^,Bc^ =^ P({r„<2/}nB)^P(B)P(G<2/). (58) 
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It remains to observe that Sn,t = St-\- Zn,t/n. With the choice (56) for 77„,t, we 
then have T„ < —7 on Cn,t{'r]n,t) as soon as St > e, and (i) follows from (58) applied to 
B ^ {St> s}, which is included into A. Finally, the assumption in (ii) is that P(^) = 1. 
Let y,z > and x < e, and observe that if T„ < y, S < x < e and ^y\Zn.t\ < z^/n, we 
have Sn < X + yz and so we are in Cn,t{'nn,t) as soon as yz < e — x. Then (58) with 
B ^ {S < x} and the fact that Z„^t converges in probability to Z yield, for any ?/, z > 
with yz < e — X, 



Since y is arbitrarily large, we conclude that P(C„_t (??«,*) H B) ^ P(^) and (ii) follows. □ 

7. Numerical experiments 

In this section, we present numerical results for three different families of models. The first 
two concern financial applications, namely, the calibration of baskets of standard stock 
prices, or energy indices, with stochastic volatilities. Our last example is not motivated 
by finance; however, it presents a kind of degeneracy which illustrates an (unsurprising) 
limitation of our estimation procedure. 

All numerical results below concern the test based on a relative threshold described in 
Section 5.4. 

7.1. Models with stochastic volatihties 

In finance, the calibration of models is a difficult issue. One has to handle missing data 
in statistical analysis; the frequency of price observations is often too weak to allow one 
to estimate quadratic variations, and thus volatilities, with good accuracies, and if it is 
high, then microstructure noises tend to blur the picture; moreover, because of market 
instabilities, any particular model with fixed parameters or coefficients can pretend to 
describe market prices over only short periods of time. Consequently, the practitioners are 
used to calibrating implicit parameters of their stock price models by solving PDE inverse 
problems (see, e.g., Achdou and Pironneau [1] and the references therein), minimizing 
entropies (see, e.g., Avellaneda et al. [2]), etc. Such procedures use instantaneous market 
information on the stocks under consideration, particularly derivative prices, rather than 
historical data. In all these approaches, as soon as one deals with a portfolio with several 
assets, the Brownian dimension is a parameter of prime importance. We have thus studied 
the performances of our estimation procedure within the commonly used Black, Scholes 
and Samuelson framework with stochastic volatilities. 



P(C„,t(7?„,t) n B) > P({T„ <y}C^B)- P(Z„,t > nz2) ^ P(B)P(G' < y). 



Consider 
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with (Ti = 0.1, (72 = 0.2, ri = 0.05 and = 0.15. 

To simulate paths of {X^,Xf), we have used the Euler scheme with stepsize 10""*. The 
final time is T = 10. The observations are at times k ■ IQ-'^, 1 < fc < 1000. In view of (45), 
we consider the estimators 



L"(l)2' L,(l)2 

Figures 1-4 are organized as follows: the left picture, displays a particular sample path 
of the pair {X} ^X"^)^ and in the right picture, we have plotted the paths of (1) (solid 
line) and ^^(1) (dashed line) corresponding to the path of the left display. Moreover, at 
each integer time t = 2, 3, . . . , 10, the right picture also displays two boxes and whiskers: 
the box and whiskers on the right plots the empirical quartiles and extends upward and 
downward to the extremal values of 500 independent samples of the random variables 

(1); the box and whiskers on the left provides similar information on ^^(1). Moreover, 
the left-hand paths in all Figures 1-4 correspond to the same simulated path of the 
Brownian motion {B^,B^). 

In this example, we see that the paths of {X^,X'^) for different values of p (and 
corresponding to the same Brownian motion {B^ , S^)) arc difficult to distinguish, whereas 
the values taken by (1) clearly allow one to distinguish the strongly correlated and 
weakly correlated cases. 

Figures 5 and 6 display the same box and whiskers pictures as previously, but with 
p = 0.00 and p = 0.99, and for two sampling frequencies T/n: for each integer time t, 
the left box and whiskers are the same on the left and right displays (they are both for 
^t(l) - beware the change of scale between the two displays), but, unsurprisingly, the 
spread of (1) is bigger at low frequency (right display). However, even at the lowest 
frequency (with only 100 observations), it allows correct estimation of the real Brownian 
dimension. 
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Figure 2. Test case 7.1: p = 0.5. 




Figure 4. Test case 7.1: p = 0.99. 
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7.2. A simplified model for energy indices 

We now present a toy model for oil prices. In his Ph.D. thesis (within a collaboration 
between INRIA and Gaz do France), O. Bardou [3] has studied modelling and simula- 
tion questions related to energy contract pricing problems. One question was to identify 
the coefficients of a stochastic differential system which could satisfyingly describe the 
dynamics of about ten energy indices. 

Here, for the sake of simplicity, we consider a three-dimensional system whose coeffi- 
cients resemble those identified by O. Bardou: for 1 < i < 3, we set 

dXl = [a,{Xl - Ki)+ + /3,] ABl + - XI) At, 

with Xl = 0.29, Xl = 0.89, X^ = 0.62. 
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Fixed this way, the diffusion term does not satisfy Hypothesis (H2). We thus slightly 
modify the equation and consider 

dXl = [a,(j){Xl - K,) + P,] dB\ + - XI) dt, 

where 0(a;) = if x < 0, = 2.bx'^ if < x < 0.2 and = x - 0.1 if a; > 0.2. In 
this very simplified model, the components of X arc independent; in the real situation 
where one observes energy indices, one should take correlated Brownian motions _B*, as 
in Section 7.1. 

We set Vi = Hi ^ ai ^ \ . The drift term then stabilizes the process around the value 1. 
If j3i~Q, the process {XI) diffuses only when XI is above the threshold Ki. 

As above, we have approximated a path of the system by simulating the Eulcr scheme 
with stepsize 10"'' between times and 10. The observations are at times k ■ 10~^, 1 < 




Figure 7. Test case 7.2: /3i = /32 = 1, = 0, Ki=K2 = 3, K3 = 0.9. 




Figure 8. Test case 7.2: /3i = 1, /32 = /33 = 0, Ki = K2 = K3 = 0.9. 
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k < 1000. In view of (45), wc consider the estimators 

St V ; l"(i)2' ^* ^ ' L"(2)3/2 



and 



In Figures 7-10, the left boxes show a particular path of {X^,X'^, X"^). The right boxes 
show the corresponding paths of (j) (solid line) and ^j(j) (dashed line) for j = 1 and 
j = 2, respectively the top and the bottom curves. Values on the right-hand side vertical 
axes denote ^^(1) and Ct(2)- Moreover, on the right, we have box and whiskers for the 
empirical quartiles of £,f (1) (top) and (2) (bottom), computed from 500 independent 
paths and for all integer times i = 2, 3, . . . , 10. The whiskers extend to the extremal values 
of the samples, with the other ticks denoting the 1%, 10%, 90% and 99% quantiles. 
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In Figure 7, the two first components diffuse from time to time 10 since Ki and 
K2 are large. The third component diffuses only a little since it is attracted to 1 and 
— A'3) = '/>(0.1) is small. Given the threshold pn = 0.01, the explicative Brownian 
dimension i?„^t is 2 since ^(1) takes values around 0.2, whereas ^(2) takes values 
around 2 • 10"^. 

In Figure 8, the two last components have a small diffusion term. As both (1) and 

(2) take values less than 0.02, according to the same threshold pn = 0.01 as above, the 
explicative Brownian dimension Rn,t is 1. 

In Figure 9, we have Ki large and </)(l - K2) = 0(1 - K3) = 0(0.4) = 0.3. Therefore, 
none of the diffusion terms can be neglected, but the first component 'oscillates' more 
than the two others. It is a case where observed paths, for which both (1) and (2) 
take values around 0.1, may make it difficult to decide whether the explicative Brownian 
dimension should be chosen as 1 or 2. 

Finally, in Figure 10, we keep the first two components as in Figure 9, but change the 
third into an almost constant process. Of course, as above, we have difficulties in deciding 
whether the Brownian dimension is 1 or 2. However, it is clear that it cannot be 3 since 

(2) fluctuates around 5 • 10~^. 



7.3. Sensitivity to a drift term close to be a martingale 

We now consider a model with a strongly oscillating drift term. These oscillations signif- 
icantly decrease the efficiency of our estimator. In certain circumstances, the explicative 
Brownian dimension overestimates the real dimension. 
The system under consideration is 



77Cos(0X2)ds, 



/o 

where i? is a one-dimensional standard Brownian motion and rj, 9 are positive real 
numbers. 

As above, we have approximated a path of X by simulating the Euler scheme with 
stepsize 10"'' between times and 10. The observations are at times k- 10^^, 1 < fc < 1000. 

In view of (45), we consider the estimator ^(1) ~ tT^7~^- Observe that Lt{2) ~ 0, so 

that ^ (1) should be close to 0. 

The boxes and whiskers denote the empirical quartiles of (1) and extend to the 
extremal values, at times t = 2, 3, . . . , 10 and for 500 estimations of (1). 

Figure 11 shows a case with such a highly oscillating coefficient that the first component 
is close to 0. The reason is clear since 

1 2?7 2n 
^* =^-7jCos(0SO 

(59) 

p Jo 
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and, here, r] = 10 and 9 = 100. As our estimator takes values around 0.02, choosing 
Pn = 0.02 leads one to choose Rn.t = 1 as the explicative Brownian dimension. 

In Figure 12, we fix 7y = 10 and 9 = 10. In view of (59), [X^) is close to being a 
stochastic integral whose covariation with B on the time interval [0, 10] is small. As our 
estimator now takes values around 0.2, we are led to choose 2 as the explicative Brownian 
dimension and thus to overestimate the real Brownian dimension. 

In Figure 13 we fix 77 = 10 and 9 — 1. As our estimator takes values around 0.2, we 
again overestimate the real Brownian dimension. 

In Figure 14, as 77 = 1 and 9—1, the first component oscillates 'reasonably'. The 
estimator takes values less than 0.01 and we are led to correctly choose 1 as the explicative 
Brownian dimension. 

Figure 15 shows histograms of (1) in the various preceding situations. Finally, Figure 
16 shows the influence of the sampling frequency in the case exhibited in Figure 14: we 
see that in this case, the Brownian dimension remains correctly estimated as 1 when the 
step size remains smaller than 10^^. 



g 




T/n ^ 10- 




T/n = 5 ■ 10 




2 4 6 8 10 2 4 6 8 10 

T/n = iO"' T/ii = 5- 10"' 



Figure 16. Test case 7.3: ^"(1) in terms of T/n (77 = 1, 6 = 1). 
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